Homology (biology)

Forelimbs in mammals provide one example of homology.

A homologous[Etymology 1] trait is any characteristic of organisms that is derived from a common ancestor. This is contrasted to analagous traits: similarities between organisms that were not in the last common ancestor of the taxa being considered but rather evolved separately. As defined by Owen (1843), a homology is a "structural correspondence", whereas an analogy is a "non-correspondent similarity".

Whether or not a trait is homologous depends on both the taxonomic and anatomical level at which the trait is examined. For example, the bird and bat wing are homologous as forearms in tetrapods. However, they are not homologous as wings, because the organ served as a forearm (not a wing) in the last common ancestor of tetrapods.[1] By definition, any homologous trait defines a clade—a monophyletic taxon in which all the members have the trait (or have lost it secondarily); and all non-members lack it.[1]

A homologous trait may be homoplasious – that is, it has evolved independently, but from the same ancestral structure – plesiomorphic – that is, present in a common ancestor but secondarily lost in some of its descendants – or (syn)apomorphic – present in an ancestor and all of its descendants.[1]

A homologous trait is often called a homolog (also spelled homologue). In genetics, the term "homolog" is used both to refer to a homologous protein, and to the gene (DNA sequence) encoding it.

Contents

Homology of structures

The wings of pterosaurs (1), bats (2) and birds (3) are analogous as wings, but homologous as forearms.

Shared ancestry can be evolutionary or developmental. Evolutionary ancestry means that structures evolved from some structure in a common ancestor; for example, the wings of bats and the arms of primates are homologous in this sense. Developmental ancestry means that structures arose from the same tissue in embryonal development; the ovaries of female humans and the testicles of male humans are homologous in this sense.

Homology is different from analogy, which describes the relation between characters that are apparently similar yet phylogenetically independent. The wings of a maple seed and the wings of an albatross are analogous but not homologous (they both allow the organism to travel on the wind, but they didn't both develop from the same structure). Analogy is commonly also referred to as homoplasy, which is further distinguished into parallelism, reversal, and convergence.[2]

From the point of view of evolutionary developmental biology (evo-devo) where evolution is seen as the evolution of the development of organisms, Rolf Sattler emphasized that homology can also be partial. New structures can evolve through the combination of developmental pathways or parts of them. As a result hybrid or mosaic structures can evolve that exhibit partial homologies. For example, certain compound leaves of flowering plants are partially homologous both to leaves and shoots because they combine some traits of leaves and shoots.[3][4]

Homology of sequences in genetics

Homology among proteins and DNA is often concluded on the basis of sequence similarity, especially in bioinformatics. For example, in general, if two or more genes have highly similar DNA sequences, it is likely that they are homologous. But sequence similarity may also arise without common ancestry: short sequences may be similar by chance, and sequences may be similar because both were selected to bind to a particular protein, such as a transcription factor. Such sequences are similar but not homologous. Sequence regions that are homologous are also called conserved. This is not to be confused with conservation in amino acid sequences in which the amino acid at a specific position has been substituted with a different one with functionally equivalent physicochemical properties.

A sequence alignment of two proteins, produced by ClustalW

The phrase "percent homology" is sometimes used but is incorrect. "Percent identity" or "percent similarity" should be used to quantify the similarity between the biomolecule sequences. For two naturally occurring sequences, percent identity is a factual measurement, whereas homology is a hypothesis supported by evidence. One can, however, refer to partial homology where a fraction of the sequences compared (are presumed to) share descent, while the rest does not. For example, partial homology may result from a gene fusion event.

Many algorithms exist to cluster protein sequences into sequence families, which are sets of mutually homologous sequences. (See sequence clustering and sequence alignment.) Some specialized biological databases collect homologous sequences in animal genomes: HOVERGEN,[5] HOMOLENS,[6] HOGENOM.[7]

Homologous sequences are of two types: orthologous and paralogous.[8]

Orthology

Homologous sequences are orthologous if they were separated by a speciation event: when a species diverges into two separate species, the divergent copies of a single gene in the resulting species are said to be orthologous. Orthologs, or orthologous genes, are genes in different species that are similar to each other because they originated from a common ancestor. The term "ortholog" was coined in 1970 by Walter Fitch.

The strongest evidence that two similar genes are orthologous is the result of a phylogenetic analysis of the gene lineage. Genes that are found within one clade are orthologs, descended from a common ancestor. Orthologs often, but not always, have the same function.

Orthologous sequences provide useful information in taxonomic classification and phylogenetic studies of organisms. The pattern of genetic divergence can be used to trace the relatedness of organisms. Two organisms that are very closely related are likely to display very similar DNA sequences between two orthologs. Conversely, an organism that is further removed evolutionarily from another organism is likely to display a greater divergence in the sequence of the orthologs being studied.

Some other specialized biological databases provide tools to identify and collect orthologous sequences: OrthoMCL[9] for eukaryotes, OrthoMaM[10] for mammals, OrthologID[11] and GreenPhylDB[12] for plants.

Paralogy

Homologous sequences are paralogous if they were separated by a gene duplication event: if a gene in an organism is duplicated to occupy two different positions in the same genome, then the two copies are paralogous.

A set of sequences that are paralogous are called paralogs of each other. Paralogs typically have the same or similar function, but sometimes do not: due to lack of the original selective pressure upon one copy of the duplicated gene, this copy is free to mutate and acquire new functions.

Paralogous sequences provide useful insight to the way genomes evolve. The genes encoding myoglobin and hemoglobin are considered to be ancient paralogs. Similarly, the four known classes of hemoglobins (hemoglobin A, hemoglobin A2, hemoglobin B, and hemoglobin F) are paralogs of each other. While each of these genes serves the same basic function of oxygen transport, they have already diverged slightly in function: fetal hemoglobin (hemoglobin F) has a higher affinity for oxygen than adult hemoglobin. Function is not always conserved, however. Human angiogenin diverged from ribonuclease, for example, and while the two paralogs remain similar in tertiary structure, their functions within the cell are now quite different.

Paralogous genes often belong to the same species, but this is not necessary: for example, the hemoglobin gene of humans and the myoglobin gene of chimpanzees are paralogs. This is a common problem in bioinformatics: when genomes of different species have been sequenced and homologous genes have been found, one can not immediately conclude that these genes have the same or similar function, as they could be paralogs whose function has diverged.

Ohnology

Ohnologous genes are paralogous genes that have originated by a process of whole-genome duplication (WGD). The name was first given in honour of Susumu Ohno by Ken Wolfe.[13] Ohnologs are interesting for evolutionary analysis because they all have been diverging for the same length of time since their common origin.

Xenology

Homologs resulting from horizontal gene transfer between two organisms are termed xenologs. Xenologs can have different functions, if the new environment is vastly different for the horizontally moving gene. In general, though, xenologs typically have similar function in both organisms.[14]

Gametology

Gametology denotes the relationship between homologous genes on nonrecombining, opposite sex chromosomes. Gametologs result from the origination of genetic sex determination and barriers to recombination between sex chromosomes. Examples of gametologs include CHDW and CHDZ in birds.

See also

Notes

  1. from Greek ομολογειν, 'to agree'

References

  1. 1.0 1.1 1.2 doi: 10.1002/bies.200900175
    This citation will be automatically completed in the next few minutes. You can jump the queue or expand by hand
  2. Cf. Butler, A. B.: Homology and Homoplasty. In: Squire, Larry R. (Ed.): Encyclopedia of Neuroscience, Academic Press, 2009, pp. 1195–1199.
  3. Sattler R (1984). "Homology — a continuing challenge". Systematic Botany 9 (4): 382–94. doi:10.2307/2418787. http://jstor.org/stable/2418787. 
  4. Sattler, R. (1994). "Homology, homeosis, and process morphology in plants". In Hall, Brian Keith. Homology: the hierarchical basis of comparative biology. Academic Press. pp. 423–75. ISBN 0-12-319583-7. 
  5. HOVERGEN: Homologous Vertebrate Genes Database
    Duret L, Mouchiroud D, Gouy M (June 1994). "HOVERGEN: a database of homologous vertebrate genes". Nucleic Acids Res. 22 (12): 2360–5. doi:10.1093/nar/22.12.2360. PMID 8036164. PMC 523695. http://nar.oxfordjournals.org/cgi/pmidlookup?view=long&pmid=8036164. 
  6. HOMOLENS: Homologous Sequences in Ensembl Animal Genomes
    Penel S, Arigon AM, Dufayard JF, et al. (2009). "Databases of homologous gene families for comparative genomics". BMC Bioinformatics 10 (Suppl 6): S3. doi:10.1186/1471-2105-10-S6-S3. PMID 19534752. PMC 2697650. http://www.biomedcentral.com/1471-2105/10%20Suppl%206/S3. 
  7. HOGENOM : Database of Complete Genome Homologous Genes Families
  8. Koonin EV (2005). "Orthologs, paralogs, and evolutionary genomics". Annu. Rev. Genet. 39: 309–38. doi:10.1146/annurev.genet.39.073003.114725. PMID 16285863. http://arjournals.annualreviews.org/doi/abs/10.1146/annurev.genet.39.073003.114725?url_ver=Z39.88-2003&rfr_id=ori:rid:crossref.org&rfr_dat=cr_pub%3dncbi.nlm.nih.gov. 
  9. OrthoMCL: Identification of Ortholog Groups for Eukaryotic Genomes
    Chen F, Mackey AJ, Stoeckert CJ, Roos DS (January 2006). "OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups". Nucleic Acids Res. 34 (Database issue): D363–8. doi:10.1093/nar/gkj123. PMID 16381887. PMC 1347485. http://nar.oxfordjournals.org/cgi/pmidlookup?view=long&pmid=16381887. 
  10. OrthoMaM
    Ranwez V, Delsuc F, Ranwez S, Belkhir K, Tilak MK, Douzery EJ (2007). "OrthoMaM: a database of orthologous genomic markers for placental mammal phylogenetics". BMC Evol. Biol. 7: 241. doi:10.1186/1471-2148-7-241. PMID 18053139. PMC 2249597. http://www.biomedcentral.com/1471-2148/7/241. 
  11. OrthologID
    Chiu JC, Lee EK, Egan MG, Sarkar IN, Coruzzi GM, DeSalle R (March 2006). "OrthologID: automation of genome-scale ortholog identification within a parsimony framework". Bioinformatics 22 (6): 699–707. doi:10.1093/bioinformatics/btk040. PMID 16410324. http://bioinformatics.oxfordjournals.org/cgi/pmidlookup?view=long&pmid=16410324. 
  12. GreenPhylDB
    Conte MG, Gaillard S, Lanau N, Rouard M, Périn C (January 2008). "GreenPhylDB: a database for plant comparative genomics". Nucleic Acids Res. 36 (Database issue): D991–8. doi:10.1093/nar/gkm934. PMID 17986457. PMC 2238940. http://nar.oxfordjournals.org/cgi/pmidlookup?view=long&pmid=17986457. 
  13. Wolfe K (May 2000). "Robustness—it's not where you think it is". Nat. Genet. 25 (1): 3–4. doi:10.1038/7556010.1038/75560 (inactive 2009-12-02). PMID 10802639. http://www.nature.com/ng/journal/v25/n1/full/ng0500_3.html. 
  14. NCBI Phylogenetics Factsheet

Further reading